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Amendments' to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 
Listing of Claims: 

1 . (Currently amended) A method for retrieving information using a search engine 
comprising the steps of: 

(a) retrieving a document to be indexed; 

(b) generating a document extract corresponding to the document by extracting a 
portion of the document that characterizes the document's subject content to form the document 
extract ; 

(c) decomposing the document extract into a plurality of tokens; and 

(d) storing the plurality of tokens in a search index, wherein the search engine 
accesses the search index to retrieve information in one or more document extracts satisfying a 
search query. 

2. (Currently amended) The method of claim 1, wherein the generating step (b) further 
comprises the steps of: 

(bl) extracting a portion of th e docum e nt that characterizes the document's 
subj e ct content to form the docum e nt e xtract; and 

(b3) — recording positional information of the portion extracted within the 

document. 

3. (Currently amended) The method of claim 31, further comprising the step of: 
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*(e) storing the document extract in a storage device. 

4. (Currently amended) The method of claim 3-2, wherein the storing step (d) further 
comprises: 

(dl) storing the recorded positional information with the plurality of tokens. 

5. (Currently amended) The method of claim 41, wherein the e xtracting generating step 
(W-) further comprises the step of: 

(bli) extracting from the document a collection of sentences that are 
characteristic of the document's subject content to form a document summary. 

6. (Currently amended) The method of claim 41, wherein the decomposing step (c) 
further comprises: 

(cl) selecting from the document extract one of a whole sentence, a portion of 
a sentence, a word, and a feature. 

7. (Original) The method of claim 6, wherein the selecting step (cl) further comprises: 

(cli) selecting based on frequency of occurrence, word-salient-measure, 
proximity to the beginning of a paragraph, proximity the beginning of the 
document,andproximity to or position within a heading or a caption. 

8. (Original) The method of claim 1, wherein the document is a web-page in the Internet. 
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'9. (Currently amended) A computer readable medium containing programming 
instructions for retrieving information using a search engine comprising the instructions for: 

(a) retrieving a document to be indexed; 

(b) generating a document extract corresponding to the documen t by extracting a 
portion of the document that characterizes the document's subject content to form the document 
extract : 

(c) decomposing the document extract into a plurality of tokens; and 

(d) storing the plurality of tokens in a search index, wherein the search engine 
accesses the search index to retrieve information in one or more document extracts satisfying a 
search query. 

10. (Currently amended) The computer readable medium of claim 9, wherein the 
generating instruction (b) further comprises the instructions for: 

(bl) extracting a portion of th e docum e nt that characteriz e s th e document's 
subj e ct cont e nt to form th e docum e nt e xtract; and 

(b3) — recording positional information of the portion extracted within the 

document. 

1 1 . (Currently amended) The computer readable medium of claim ^9, further comprising 
the instruction for: 

(e) storing the document extract in a storage device. 

12. (Currently amended) The computer readable medium of claim 4410, wherein the 



4 



Attorney Docket: DE920000094US1/2265P 

storing instruction (d) further comprises the instruction for: 

(dl) storing the recorded positional information with the plurality of tokens. 



13. (Currently amended) The computer readable medium of claim 439, wherein the 
e xtracting generating instruction (Wb) further comprises the instruction for: 

(Wibl) extracting from the document a collection of sentences that are 
characteristic of the document's subject content to form a document summary. 

14. (Currently amended) The computer readable medium of claim 439, wherein the 
decomposing instruction (c) further comprises the instruction for: 

(cl) selecting from the document extract one of a whole sentence, a portion of 
a sentence, a word, and a feature. 

15. (Original) The computer readable medium of claim 14, wherein the selecting 
instruction (cl) further comprises the instruction for: 

(c 1 i) selecting based on frequency of occurrence, word-salient-measure, 
proximity to the beginning of a paragraph, proximity the beginning of the 
document, and proximity to and position within a heading and a caption. 

16. (Original) The computer readable medium of claim 9, wherein the document is a 
web-page in the Internet. 

17. (Currently amended) A system for retrieving information, wherein the system includes 
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a search engine comprising: 

means for retrieving a document from a document repository; 

an information extractor coupled to the means for retrieving, wherein the information 
extractor generates a document extract corresponding to the documen t by extracting a portion of 
the document that characterizes the document's subject content to form the document extract ; 

a storage device coupled to the information extractor for storing the document extract; 

a search engine indexer coupled to the storage device for decomposing the document 
extract into a plurality of tokens; and 

a search index coupled to the search engine indexer for storing the plurality of tokens, 
wherein the search engine accesses the search index to retrieve information in one or more 
document extracts satisfying a search query. 

18. (Currently amended) The system of claim 17, wherein the information extractor 
e xtracts a portion of the docum e nt that characterizes tho docum e nt's subject cont e nt to form the 
document e xtract, and records positional information of the portion extracted within the 
document. 

19. (Original) The system of claim 18, wherein the search index stores the positional 
information associated with the plurality of tokens. 

20. (Currently amended) The system of claim 4-917, wherein a token of the plurality of 
tokens comprises one of a whole sentence, a portion of a sentence, a word, and a feature of the 
document. 
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21 . (Currently amended) The system of claim 3017, wherein the search engine indexer 
selects the plurality of tokens based on frequency of occurrence, word-salient-measure, proximity 
to the beginning of a paragraph, proximity the beginning of the document, and proximity to and 
position within a heading and a caption. 

22. (Original) The system of claim 17, wherein the document respository is the Internet 
and the document is a web-page. 

23. (Original) The system of claim 22, wherein the means for retrieving the document is 
a web crawler. 
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